Graph Structured Normal Means Inference
نویسندگان
چکیده
This thesis addresses statistical estimation and testing of signals over a graph when measurements are noisy and high-dimensional. Graph structured patterns appear in applications as diverse as sensor networks, virology in human networks, congestion in internet routers, and advertising in social networks. We will develop asymptotic guarantees of the performance of statistical estimators and tests, by stating conditions for consistency by properties of the graph (e.g. graph spectra). The goal of this thesis is to demonstrate theoretically that by exploiting the graph structure one can achieve statistical consistency in extremely noisy conditions. We begin with the study of a projection estimator called Laplacian eigenmaps, and find that eigenvalue concentration plays a central role in the ability to estimate graph structured patterns. We continue with the study of the edge lasso, a least squares procedure with total variation penalty, and determine combinatorial conditions under which changepoints (edges across which the underlying signal changes) on the graph are recovered. We will shift focus to testing for anomalous activations in the graph, using the scan statistic relaxations, the spectral scan statistic and the graph ellipsoid scan statistic. We will also show how one can form a decomposition of the graph from a spanning tree which will lead to a test for activity in the graph. This will lead to the construction of a spanning tree wavelet basis, which can be used to localize activations on the graph.
منابع مشابه
Gibbs Sampling for Logistic Normal Topic Models with Graph-Based Priors
Previous work on probabilistic topic models has either focused on models with relatively simple conjugate priors that support Gibbs sampling or models with non-conjugate priors that typically require variational inference. Gibbs sampling is more accurate than variational inference and better supports the construction of composite models. We present a method for Gibbs sampling in non-conjugate l...
متن کاملComparing Mean Vectors Via Generalized Inference in Multivariate Log-Normal Distributions
Abstract In this paper, we consider the problem of means in several multivariate log-normal distributions and propose a useful method called as generalized variable method. Simulation studies show that suggested method has a appropriate size and power regardless sample size. To evaluation this method, we compare this method with traditional MANOVA such that the actual sizes of the two methods ...
متن کاملStructured Prediction of Generalized Matching Graphs
A structured prediction approach is proposed for completing missing edges in a graph using partially observed connectivity between n nodes. Unlike previous approaches, edge predictions depend on the node attributes (features) as well as graph topology. To overcome unrealistic i.i.d. edge prediction assumptions, the structured prediction framework is extended to an output space of directed subgr...
متن کاملEfficient probabilistic models for inference and learning
This project was concerned with enriching probabilistic models with structured knowledge representation. By a probabilistic model we mean any formalism that can be used to specify a complex probability distribution. For instance, a Bayesian network specifies a joint probability distribution over a tuple of random variables by means of a directed acyclic graph, in such a way that only the condit...
متن کاملStructured Label Inference for Visual Understanding
Visual data such as images and videos contain a rich source of structured semantic labels as well as a wide range of interacting components. Visual content could be assigned with fine-grained labels describing major components, coarse-grained labels depicting high level abstractions, or a set of labels revealing attributes. Such categorization over different, interacting layers of labels evince...
متن کامل